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Abstract 

If a cellular automaton (CA) is started with a single ON cell, how many cells will be ON 
after n generations? For certain “odd-rule” CAs, including Rule 150, Rule 614, and Fredkin’s 
Replicator, the answer can be found by using the combination of a new transformation of 
sequences, the run length transform, and some delicate scissor cuts. Several other CAs are also 
discussed, although the analysis becomes more difficult as the patterns become more intricate. 


1 Introduction 

When confronted with a number sequence, the first thing is to try to conjecture a rule or formula, 
and then (the hard part) prove that the formula is correct. This article had its origin in the study 
of one such sequence, 1, 8, 8, 24, 8,64, 24,112, 8, 64, 64,192,... ( A160239 1 ), although several similar 
sequences will also be discussed. 

These sequences arise from studying how activity spreads in cellular automata (for background 
see [2, 5, 8, 11, 14, 17, 20, 21, 23, 24, 26]). If we start with a single ON cell, how many cells will 
be ON after n generations? The sequence above arises from the CA known as Fredkin’s Replicator 
[13]. In 2014, Hrothgar sent the author a manuscript [10] studying this CA, and conjectured that 
the sequence satisfied a certain recurrence. One of the goals of the present paper is to prove that 
this conjecture is correct—see (31). 

In Section 2 we discuss a general class (the “odd-rule” CAs) to which Fredkin’s Replicator 
belongs, and in §3 we introduce an operation on number sequences (the “run length transform”) 
which helps in understanding the resulting sequences. Fredkin’s Replicator, which is based on the 
Moore neighborhood, is the subject of §4, and §5 analyzes another odd-rule CA, based on the von 
Neumann neighborhood with a center cell. Although these two CAs are similar, different techniques 
are required for establishing the recurrences. Both proofs involve making scissor cuts to dissect the 
configuration of ON cells into recognizable pieces. 

Section 6 discusses some other CAs in one, two, and three dimensions where it is possible to 
find a formula, and some for which no formula is presently known. In dimension one, Stephen 
Wolfram’s well-known list [17, 24, 26] of 256 different CAs based on a three-celled neighborhood 
gives rise to just seven interesting sequences (§6.1). Other two-dimensional CAs are discussed in 
§§6.2, 6.3, and the three-dimensional analog of Fredkin’s Replicator in §6.4. The final section (§7) 
gives some additional properties of the run length transform. For many further examples of cellular 
automata sequences, see [2] and [16] (the index to [16] lists nearly 200 such sequences). 

1 Six-digit numbers prefixed by A refer to entries in [16]. 
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Figure 1: Some two-dimensional neighborhoods (Figs, (i) and (iv) are three and five cells wide, 
respectively). 

2 Odd-rule CAs 

We consider cellular automata whose cells form a d-dimensional cubic lattice Z d , where d is 1, 2, or 3. 
Each cell is either ON or OFF, and an ON cell with center at the lattice point u = (u \, U 2 , ■ ■ ■, Ud) £ 
1i d will be identified with the monomial x u = x^x^ 2 ■ ■ ■ x 1 / , which we regard as an element of the 
ring of Laurent polynomials TZ = GF(2)[xi, x^" 1 ,..., Xd, x^ 1 } with mod 2 coefficients. The state of 
the CA is specified by giving the formal sum S of all its ON cells. As long as only finitely many 
cells are ON, S is indeed a polynomial in the variables x t and x ~ l , and is therefore an element of 
TZ. We write u G S to indicate that u is ON, i.e., that x u is a monomial in S. 

In most of this paper we will focus on what may be called “odd-rule” CAs. An odd-rule CA is 
defined by specifying a neighborhood of the cell at the origin, given by an element F £ 1Z listing the 
cells in the neighborhood. A typical example is the Moore neighborhood in Z 2 , which consists of 
the eight cells surrounding the cell at the origin in the square grid (see Fig. l(ix)), and is specified 
by 

„ llxl V 

F :=-1-1-1-hxH-I -y + xy 

xy y y x x 

l+l + aAQ + l + yj-l G U = GF(2)[x,x~ l ,y,y~ 1 ]. (1) 

The neighborhood of an arbitrary cell u is obtained by shifting F so it is centered at u, that is, 
by the product x u F 6 1Z. Given F. the corresponding odd-rule CA is defined by the rule that the 
cell at u is ON at generation n + 1 if it is the neighbor of an odd number of cells that were ON at 
generation n, and is otherwise OFF. 

Our goal is to find a n (F), the number of ON cells at the nth generation when the CA is started 
in generation 0 with a single ON cell at the origin. For odd-rule CAs there is a simple formula. 
The number of nonzero terms in an element P G 7Z will be denoted by \P\. 
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Theorem 1. For an odd-rule CA with neighborhood F, the state at generation n is equal to F n , 
and an(F) = \F n \. 

Proof. We use induction on n. By definition, the initial state is 1 = P°, and ao(F) = 1. The ON 
cell at the origin turns ON all the cells in F, so the state at generation 1 is F itself, and a\(F) = |P|. 
Suppose the state at generation n is F n . An ON cell w G F n will affect a cell u if and only if u is 
in the neighborhood of w, that is, if and only if u G wF. For u to be turned ON, there must be an 
odd number of cells w G F n with u G wF. Since the coefficients in 7 Z are evaluated mod 2, u will 
be turned ON if and only if u G wF = PX^eF n w = F-F n = F n+1 . So F n+1 is precisely 

the state at generation n + 1, and a n+ i(F) = |P n+1 |. □ 


3 The run length transform 


We define an operation on number sequences, the “run length transform”. For an integer n > 0, 
let £(n) denote the list of the lengths of the maximal runs of Is in the binary expansion of 
n. For example, since the binary expansion of 55 is 110111, containing runs of Is of lengths 
2 and 3, £(55) = [2,3]. £(0) is the empty list, and £(n) for n = 1,..., 12 is respectively 

[1], [1], [2], [1], [1,1], [2], [3], [1], [1,1], [1,1], [1, 2], [2] ( A245562 ). 

Definition. The run length transform of a sequence [S n ,n > 0] is the sequence [T n ,n > 0] given 
by 

Tn = W Si. (2) 

i£C(n) 

Note that T n depends only on the lengths of the runs of Is in the binary expansion of n, not on the 
order in which they appear. For example, since £(11) = [1, 2] and £(13) = [2,1], T\\ = T 13 = S\ S- 2 - 

Also To = 1 (the empty product), so the value of So is never used, and will usually be taken to be 

1. Further properties and additional examples of the run length transform will be given in §7. See 
especially Table 4, which shows how the transformed sequence has a natural division into blocks. 
Define the height ht(P) of an element P G 1Z to be the maximal value of |ej| in any monomial 
‘ ’ ’ x< ct i n P- ^ ht(P) = h, the cells in P are contained in a d-dimensional cube centered at the 
origin with edges that are 2h+l cells long. Note that ht(PQ) < ht(P)+ht(Q) and ht(P fc ) = k ht(P). 
The second property that makes odd-rule CAs easier to analyze than most is the following. 

Theorem 2. If ht(P) < 1. then [a n (F),n > 0] is the run length transform of the subsequence 

[ao(P),ai(P),a 3 (P),a 7 (P),ai 5 (P),.. ,,a 2 k_ 1 (F),...}. (3) 


Proof. The proof depends on the identity sometimes called the Freshman’s Dream, which in its 
simplest form states that ( x + y ) 2 = x 2 + y 2 mod 2, and more generally that for P(aq, xf 1 , ...)gK, 


p(x\,x 1 1 ,.. .y~ = p(x f 


_ 2 /c 


...), 


(4) 


for any integer k > 0, and in particular that |P(aq, x l x ,.. .) 2fe | = |P|. Suppose first that the binary 
expansion of n contains exactly two runs of Is, separated by one or more 0 s, say 


n 


m 1 777-2 7773 

1 oo^~o 


> 1 , 


i.e., 

n = ( 2 mi - l) 2 ma+m3 + ( 2 m3 - 1 ), 
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with C(n) = [ 7711 , 7713 ]. Then 


_ l C1 2 m l-l',2 m 2+ m 3^ ? 2 m 3-l _ p 2 m 2+ m 3 


F n = (F 


= P A 


'Q (say), 


(5) 


where P := F 2 "' 1 ^ 1 , Q := F 2m3_1 . Equation (5) states that F n is a sum of copies of Q centered 
at the cells of of p2 m 2+ m 3. gy the Freshman’s Dream, p 2, " 2 ~ r " 3 i s a polynomial in the variables 
xi t2m2+m 3 ) so the cells in p 2 "" 2 ~"" 3 are separated by at least 2 m2+m3 . Also, |p 2 ’" 2+mi | = |p|. On 
the other hand, since ht(-F) < 1, ht(Q) < 2 m3 — 1, and since 

2(2 ms - 1) + 1 < 2 m3+1 < 2 TO2+m3 


the copies of Q in F n are disjoint from each other, and so \F n \ = |P||Q|, or in other words 

a-n(F) = a2 m i-i(F) a2™s-i{F) = ]^[ a 2 i_ 1 (i ? ). 

i&C(n) 

It is straightforward to generalize this argument to the case when there are more than two runs of 
Is in the binary expansion of n, and to establish that for any n, 

a n {F) = a 2 i_i(F), (6) 

i£C(n) 

thus completing the proof. □ 

In several interesting cases the subsequence (3) satisfies a three-term linear recurrence, in which 
case there is also a simple recurrence for the run length transform. 

Theorem 3. Suppose the sequence [S n ,n > 0] is defined by the recurrence S n+ 1 = C 2 S n + 
c^S n -i,n > 1, with So = 1, S\ = c\. Then its run length transform [T n ,n > 0] satisfies the 
recurrence 

T 2 t = T t , T 4t+1 = c\T t , T^t +3 = c 2 T 2t+1 + cfiTt, (7) 

for t > 0, with To = 1. 

Proof. T 2 t = Tf is immediate from the definition of the run length transform, since C(2n) = C{n). 
The binary expansion of At + 1 ends in 01, so T 4 i +1 = T t Si = c\T t - If t = 2 k - 1 for some k > 1 
then T 4 i + 3 = Sk +2 = ( pSk+\ + c^Sk, T ± t +1 = c\Sk, T 2t+ 1 = 5^+1, implying 


74t+3 = c 2 T 2t+ i + csT t . (8) 

On the other hand, if t has a zero in its binary expansion, say t = i.2 k+1 + ( 2 k — 1 ),k > 0, then 
74t+3 = TiSk+2 = Ti(c2S k + 1 + c 3 S k ), T At+l = TidSk, T 2t +i = 7)5fc+i, and again (8) follows. □ 


4 Fredkin’s Replicator 

The cellular automaton known as Fredkin’s Replicator [7, 8, 15] is the two-dimensional odd-rule CA 
defined by the Moore neighborhood F shown in Fig. l(ix) and Eq. (1). (This is the eight-neighbor 
totalistic Rule 52428 in the Wolfram numbering scheme [17, 24, 26].) 

We study the evolution of this CA when it is started at generation 0 with a single ON cell 
at the origin. Generations 0 through 8 are shown in Fig. 2. The name of this CA comes from 
the fact that any configuration of ON cells will be replicated eight times at some later stage. For 
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Figure 2: Generations 0 through 8 of the evolution of Fredkin’s Replicator. ON cells are black, 
OFF cells are white. 


Table 1: Number of ON cells at nth generation of Fredkin’s Replicator (A160239). 


n 

On 

0 

1 












1 

8 












2-3 

8 

24 











4-7 

8 

64 

24 

112 









8-15 

8 

64 

64 

192 

24 

192 

112 

416 





16 - 31 

8 

64 

64 

192 

64 

512 

192 

896 

24 

192 

192 

576 112 896 416 1728 

32 - 63 

8 

64 

64 

192 

64 

512 

192 

896 

64 

512 

512 

1536 ... 


example, generation 1 is replicated eight times at generation 5. Although distinctive, the name is 
not especially appropriate, since by (4) any odd-rule CA has a similar replication property. Let 
a n (F ) = a n denote the number of ON cells at the nth generation. The initial values of a n are 
shown in Table 1. 

Since ht(F’) = 1, we know from Theorem 2 that [o n ,n > 0] is the run length transform of the 
subsequence [b n = 02 «-i, n > 0] = 1, 8, 24,112,416,1728,... (shown in bold in Table 1; it will turn 
out to be A246030). The main result of this section is the identification of this subsequence. 
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Theorem 4. The sequence [b n ,n > 0] satisfies the recurrence 


b n+ 1 = 2 b n + 86 n _i, with b 0 = 1, &i = 8 . (9) 

Proo/. Let := P n , H n := G 2 «-i = P 2 ”" 1 . (Figure 2 shows G 0 = H 0 , Gi = Pi, G 2 , G 3 = H 2 , 
G 4 , G 5 , Gq, G 7 = H 3 , and Gs-) By definition, 

Pn+l = P 2 ”Pn, (10) 

and, from Theorem 1 , a n = |G n |, b n = |P n |. 

Since P has diameter 3, the nonzero terms x l y 3 in H n satisfy 

— ( 2 n — 1 ) <i,j< 2 "-l, ( 11 ) 

so we can write 

2 "-l 2 n —1 

H n = Y Y Hnihj^y 3 , ( 12 ) 

i=—(2 n —1) j=—(2" —1) 

where the coefficient H n (i,j) E GF(2) gives the state of the cell (i, j) at generation n. 

From (10) and (4), P n +i is the sum (in TV) of eight copies of H n . translated by 2 n in each of 
the N, NW, W, SW, S, SE, E, and NE directions. That is, for n > 1, 

H n+ 1 (i,j) = H n (i, j - 2 n ) + H n [i - 2 n ,j - 2 n ) + H n (i - 2 n ,j) + H n {% - 2 n ,j + 2 n ) 

+ H n (i,j + 2 n ) + H n (i + 2, n ,j + 2 n ) + H n (i + 2 n ,j) + H n (i + 2 n ,j — 2 n ), (13) 

where we adopt the convention that = 0 unless i and j satisfy (11). Also, 

Pq(0, 0) = 1, H 0 (i,j) = 0 for (i,j) ^ (0,0), (14) 


and H] (i, j) = 0 except for 

Pi(0,l) = Pi(-l,l) = Pi(-l,0) = Pi(-l,-l) 

= Pi(0, -1) = Pi(l, -1) = Pi(l, 0) = Pi(l, 1) = 1. (15) 

By construction, H n is preserved by the action of the dihedral group of order 8 (the symmetry 
group of the square), generated by the action of (x,y) -H- (y,x) and (x,y) E4 (y,y). We study H n 
by breaking it up into the central cell, the four parts on the axes, and the four quadrants. 

The central cell. The central cell P ra (0,0) = 1 if n = 0, and (as a consequence of the 8-fold 
symmetry) is 0 for n > 0. The axial parts. We define X n [n > 1) to be the portion of H n that 
lies on the positive ;c-axis, but normalized so that its center is at the origin: 

2 n —1 

x n := £ H n (i,0)x\ (16) 

X . 

2=1 

For example, X\ = 1, X 2 = \ + x, X 3 = ^ ^ + x + x 3 . From (13) it follows by induction that, 

for n > 2, 

2 n—2 —1 , \ 2 "- 1 -! 

X n = J2 (s _(2i+1) + ® M+1 ) = Y+x) . (17) 

i=0 ^ x ' 
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Similarly, the portion of H n that lies on the negative x-axis, normalized so that its center is at the 
origin, is 

_ 2 n -i 

X n := x 2 " 1 H n (-i,0)x~' 1 = y~ + x J = x n- (18) 

Likewise, the normalized portions of H n on the positive and negative y-axes are 


Y — Y — 

1 n — 1 n. — 



(19) 


The four quadrants. Next, define I n for n > 1 to consist of the portion of H n lying in the first 
quadrant, again normalized so that its center is at the origin: 


2 n — 1 2"-l 


In : = 


(xy) 


Y Y HnihjW 


i =1 3 =1 


Similarly, we define 


I In := I - 


2 n ~ 1 2 n — l2 n —l 


Y Y H n(~hj) x V, 


i =1 j =1 
2 n —1 2 n —l 

III n := (xy) 2 " 1 Y 

i=i j =i 


x l y \ 


IV n := 


Y Y H n(i,-j)x l y 3 ■ 

*= 1 3 =1 


we 

see that, for n 

> 1, 

H n = 




(y/x) 2 - 1 II n 

+ 

y^Vn 

+ 

1 ? 

to 

3 

1 

1—1 

3 

+ 

(l/x) 2 "- 1 V n 

+ 

0 

+ 

x 2 X n 

+ 

(xy)- 2 "- 1 III„ 

+ 

y I n 

+ 

{x/y) 2n l IVn 


which we write as a matrix 


H n = 


TT Y T ' 

x n o x n 

III n V n IVn. 


( 20 ) 


( 21 ) 


( 22 ) 


where it is to be understood that the blocks are to be shifted by the appropriate amounts (that is, 
the I n in the top right corner is to be multiplied by (xy) 2 " , and so on). By summing the eight 

translated copies of H n , as in (13), we obtain 


' IIn 

Y n 

In + Hn 

Y n 

In V Hn 

Y n 

In 

X n 

0 

X n + X n 

0 

X n + X n 

0 

X n 

e 

i—i 
i—i 
►—1 

+ ' 
e 

i—i 

HH 

Y n + Y n 

I n + IIIn + IVn 

Yn 

Hn +IHn + IVn 

Yn+% 

In T IV5 

X n 

0 

X n 

0 

X n 

0 

X n 

Hn + IHn 

Y n + Y n 

I n + Hn + IV n 

Y n 

I n + Hn + IHn 

Y n + Y n 

In + IV, 

X n 

0 

x n + x n 

0 

X n + X n 

0 

V n 

. IHn 

Yn 

IIIn + IVn 

Y n 

IIIn + IVn 

Y n 

IVn 
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Using (18) and (19), we have 



i 

i—i 
i—i 

3 

Y n 

Ira + Hra 


Ira + Hra 

Y n 

1- 

£ 
i—i 

Hn+1 — 


0 

0 

j U+1 — 

0 

0 

X n 


Iln+IUn 

0 

Ira + IIIra + IVra 


lira T Ulra T IVra 

0 

Ira T IV n 


( 24 ) 



'Iln + IUn 

0 

£ 

> 
I—I 

+ 

£ 
i—i 

l-H 

+ 

£ 

1—1 


£ 
i—i 
i—i 

HH 

+ 

£ 
i—i 

l-H 

+ 

£ 

| i—i 

0 

1 

£ 

> 
i—i 

+ 

£ 
i—< 

II 

+ 
£ 
i—i 
i—i 

l-H 

X n 

0 

0 

, IV n+ i = 

0 

0 

X n 


£ 
i—i 

HH 

1—1 

_1 

Y n 

i 

£ 

> 

H-1 

+ 

£ 
I—I 

H—1 

1—1 


IIIra + IYn 

Y n 

IVra 


(25) 


By adding these four matrices we find that I n+ i + II n +i + III n +i + IVn+i = 0 for n > 1. This 
identity is also true for n = 0, and we conclude that 


In + lira + Ulra + IV n — 0, Tl > 1, 


and so 




lira 

V n 

Ira + Ura 



Ira T Ura 

V n 

£ 
i—i 

1—1 
1—1 
£ 

+ 

II 


X n 

0 

0 


Ira+1 — 

0 

0 

X n 


Hra + IHra 

0 

Ura 



In 

0 

Ira + IV 

etc., and finally that, 

for n > 1 










' lira 

Yn 

In + Ura 

Y n Ira + Ura 

Y n 

Ira 



X n 

0 

0 


0 

0 

0 

X n 



II„ + IIIra 

0 

Ura 


Y n 

Ira 

0 

Ira + IV 

Hn +1 — 


X n 

0 

X n 


0 

X n 

0 

X n 



lira + IHra 

0 

IHra 


Y n 

IV„ 

0 

Ira + IV 



X n 

0 

0 


0 

0 

0 

X n 



IHra 

Y n 

Illra+IVn 

Y n Ulra T IVra 

Y n 

I v n 


(26) 


(27) 


(28) 


In (28) we see that H n +i contains a copy of H n at its center. The four corner blocks together with 
two copies each of X n and Y n form another, “deconstructed”, copy of H n . 

Suppose n > 2, and consider the blocks [I„ +II„ Y n l n + II n Y n ] in the top row of (28). Using 
(27) these blocks can be expanded to give 


Ira—1 

0 

Ura—1 

Y n - 1 

Ira—1 

0 

Ura—1 

Y n -1 


X n - 1 

0 

V n _i 

0 

X n -1 

0 

V n _, 

0 

(29) 

IV n _l 

0 

IHra-1 

Y n - 1 

IV„_1 

0 

IIIra-1 

Y n - 1 



The central three columns (columns 3, 4, and 5) give a copy of H n _ i, and columns 7, 8, and 1, in 
that order, give another copy. We get two further copies of H n ~i from the analogous blocks in each 
of the other three sides of (28), so in total H n +i contains as many ON cells as are in two copies of 
H n plus eight copies of H n - 2 - This implies (9) for n > 2. Equation (9) is certainly true for n = 1, 
so this completes the proof of the theorem. □ 

The characteristic polynomial of (9) is x 2 — 2x — 8 = (x + 2)(x — 4), and it follows that 

5.4 n + ( -2 ) n+1 

On — 


3 


n > 0. 


(30) 




















Figure 3: Dissection of H 4 into pieces that can be reassembled to give two copies of H 3 and eight 
copies of H 2 , illustrating (9) in the case n = 3. 


In geometric terms, the proof of Theorem 4 shows that H n+ \ can be dissected into pieces that 
can be reassembled to give two copies of H n and eight copies of H n _ Figure 3 shows this dissection 
in the case of H±. The central dashed square (colored red in the online version) encloses a copy 
of H3. The smaller dashed squares on the four sides (colored blue) enclose copies of H2. The four 
pairs of vertical parallel lines (colored green) enclose copies of I 3 , and the four pairs of horizontal 
parallel lines (also green) enclose copies of X3. The four corners (outside the parallel lines) are, 
reading counter-clockwise from the top right corner, respectively I 3 , II 3 , III 3 , and IV 3 , and combine 
with two copies each of X3 and I 3 to give the second copy of H3. Along the top edge, the figure is 
divided into seven pieces, as in the top row of (28). By taking the fifth, sixth, and third pieces in 
that order gives another copy of H 2 , and three further copies are obtained from the other edges of 
the figure. 

From Theorem 3 and (9), we see that a n is given by the recurrence 

« 2 1 = a t , (Dm = 8 at, 04 t _)_3 = 2 a, 2 t+i + 8 at, (31) 

for t > 0, with ao = 1, as conjectured by Hrothgar [10]. 

5 The centered von Neumann neighborhood 

In this section we analyze the two-dimensional odd-rule CA defined by the five-celled neighborhood 

F = - + l + x+- + y G TZ = GF(2)[x,x~ 1 ,y,y~ 1 } (32) 

x y 
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shown in Fig. l(vi), and consisting of the von Neumann neighborhood together with its center. 
(This is the five-neighbor totalistic Rule 614.) We use the same notation as in the previous section, 
except that now F is defined by (32) instead of (1). 

The initial values of [a n ,n> 0] are 

1, 5, 5,17, 5, 25,17, 61, 5, 25, 25, 85,17,85, 61, 217,... 

(A072272), which we know from Theorem 2 is the run length transform of the subsequence [b n ,n > 
0] = 1, 5,17,61, 217, 773, 2753,..., shown in bold (A007483). Generations 0 through 8 are shown in 
Fig. 4, and Figs. 5 and 6 show generation 31. 


■ 

+ 

■ 

■ ■ ■ 

■ 

❖ 

■ 

■ ■ ■ 

■ 

+ 

++ + 

+ 

■ 

■ ■ ■ 

■ ■ 

■ ■ ■ ■ ■ 

■ ■ 

■ ■ ■ 

■ 

< 0 - 

■ 

■ ■ ■ 

■ 


Figure 4: Generations 0 through 8 of the odd-rule CA defined by the centered von Neumann 
neighborhood. Generations 0 , 1, 3, 7 show Hq, H i, Ho , Ho, respectively. 


Theorem 5. The sequence [b n ,n > 0] satisfies the recurrence 

b n+ 1 = 3 b n + 2b n -i, with b 0 = 1, bi = 5. (33) 


Proof. As in the proof of Theorem 4, H n = F 2 " -1 , b n = \H n \, and again H n is preserved under the 
action of a dihedral group of order 8 . The first step in the proof is to show that for n > 2, H n can 
be dissected into a central copy of 77 n _2 and four disjoint pentagonal or “haystack”-shaped regions 
(see Fig. 5 for the dissection of Hfi). The four haystacks in H n will be denoted by N n , W n , S n . and 
E n . according to the direction in which they point (a precise definition will be given below). They 
are equivalent under the action of the dihedral group. Algebraically, we will show that 


H n — 77(1-2 + U 2 N n 


+ x 


— 2 " 


hF n + y 


—2 n 


S n +X 1 


E„ 


(34) 
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Figure 5: Generation 31 (Hr,), showing dissection into central copy of H 3 (inside red dashed line) 
and four “haystacks” IV 5 , W 5 , S 5 , E$ (each enclosed by blue dashed line). 


where the five polynomials on the right are disjoint (i.e., have no monomials in common with each 
other). 

Once we have established (34), the second step in the proof will be to show that each haystack 
can be dissected into five smaller haystacks (see Fig. 6 for the dissection of IV 5 ). In particular, we 
will show that for n > 3, 


071 — 2 071 — 2 O 

N n = y N n - 1 + x " W n -\ + x 


n—2 _oti—3 

E n —i + Ef n ~ 2 + y S n - 2 , 


(35) 


where again the polynomials on the right are disjoint. Let v n := |lV n | = \W n \ = \S n | = \E n \. Then 
(35) implies u n = 2>v n _\ + 2z/ n _2. From (34) we have b n = b n _ 2 + 4z/ n , so 


b n+ 1 - 3 b n - 26 n _i = b n -1 - 3 b n -2 - 2b n -3 = ■■■ = either 63 - 362 - 2 b\ or 6 2 - 36i - 2 6 0 , 

and each of the last two expressions evaluates to zero. This will complete the proof of (33). It is 
worth remarking that these dissections are of a different nature from the dissection in the previous 
section. There it was necessary to make some non-obvious cuts through the contiguous blocks of 
ON cells, as shown by the parallel (green) lines in the corners of Fig. 3. In contrast, in the present 
proof, the dissections are carried out by “tearing” along the obvious “perforations”, rather like 
tearing apart a block of postage stamps. 

Now to the details. It follows from the definition (see the sequence of successive states in Fig. 4) 
that H n is a diamond-shaped configuration with extreme points (±(2 n — 1), 0), (0, ±(2 n — 1)). Also, 
for n > 2, H n contains a copy of H n - 2 at its center, surrounded by a layer, at least one cell wide, 
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Figure 6 : Another view of generation 31 (H§) showing dissection of haystack IV 5 into three smaller 
haystacks JV 4 , W 4 . E 4 (each enclosed in green dashed line) and two still smaller haystacks IV 3 and 
S 3 (inside blue dashed lines). 


of OFF cells. This follows from the identity 

H n -H n -2 = F 2 "- 1 - F 2 "- 2 - 1 = H n . 2 (l + Hf- 2 ), 

upon checking that the right-hand side contains no monomials x l y J with |i| + |j| < 2 n_2 . The buffer 
layer of OFF cells around the central H n _ 2 consists of the cells x i y^ with |i| + \j\ = 2 n ~ 2 . 

We define the rath North haystack to be 

N n '■= H n _ 2 Tn , n > 2, (36) 

where T^v := 1/x + l + x + y is the four-celled North-pointing triangle shown in Fig. l(iii). Similarly, 
the West, South, and East haystacks are 

W n := H n - 2 T'w~ 2 , Sn := H n - 2 Tf~\ E n := H n - 2 T%~\ (37) 

where T\y := 1/x + l/y +1 + y is a West-pointing version of Tjy, and similarly for Ts and Te- (The 
simple expressions in (36) and (37) were guessed by computing the actual haystacks in H 3 to Hq 
and using Maple to factor them in 1Z.) The haystack N n has the property that all its cells are on 
or inside the convex hull of the five cells x l y J with (i,j) equal to 

(0, 2 n ~ l - 1), (—2 n_1 + 1, 0), (—2 n_2 , —2 n_2 + 1), (2 n-2 - 2 n_2 + 1), (- 2 n ~ l + 1,0). (38) 

To see this, consider what happens to N n one generation later: it becomes 

FN n = {FT N f n ~\ (39) 
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using the Freshman’s Dream (4), where FT n is the six-celled configuration 


y + 


1 1 x 

H-1-1- \~ x 

xy y y 


2 


From (39), FN n is this configuration with the cells moved 2 n ~ 2 steps apart, and the cells (38) lie 
just inside it. 

Note that the point (0, 0) is in the interior of N n at the intersection of the vertical line through 
the apex and the horizontal line joining the most Western and Eastern points. The powers of x 
and y in (34) and (35) are needed in order to translate the haystacks into their correct positions. 
Since we now know the boundaries of all the terms on the right side of (34), we can check that 
these five polynomials are indeed disjoint. We must still check that (34) is an identity. Using the 
Freshman’s Dream, this reduces to checking the identity 


#2 = 1 + V 2 Tn + X 2 T W + y 2 T s + x 2 T e , 


which is true. This completes the proof that (34) is a proper dissection of H n . The correctness of 
the dissection (35) is verified in a similar way; we omit the details. □ 


6 Other Cellular Automata 

6.1 The 256 one-dimensional rules 

There are 256 possible CAs based on the one-dimensional three-celled neighborhood shown in 
Fig. l(i). These are the CAs labeled Rule 0 through Rule 255 in the Wolfram numbering scheme 
[17, 24, 26]. As usual we assume the automaton is started with a single ON cell, and let a n denote 
the number of ON cells after n generations. Illustrations of the initial generations of all 256 CAs 
are shown on pages 54-56 of [26]. Many of these sequences were analyzed in [23]; see also [22]. If 
we eliminate those in which some a n is infinite, or the sequence [a n ,n > 0 ] is trivial (essentially 
linear), or is a duplicate of one of the others, we are left with just seven sequences: 

• Rule 18 (or Rule 90; Rule 182 is very similar): a n = 2 wt (”) (Gould’s sequence A001316), where 
wt(n) is the number of Is in the binary expansion of n. This is the run length transform of 
the powers of 2 . 

• Rule 22: a n = 2 wt W if n even, 3 . 2 wt +)-i if n odd (A071044). 

• Rule 30: The behavior appears chaotic, even when started with a single ON cell [22, 26]. The 
sequence, A070952 , has roughly linear growth, but it seems likely that there is no simpler 
way to obtain it than by its definition. 

• Rule 62: a n+ 7 = a n+ 4 + a n+ s — a n with initial terms 1 , 3, 3, 6 , 5, 8 , 9 (A071047). 

• Rule 110: Although the initial behavior is chaotic, it is an astonishing fact, pointed out by 
Wolfram [26, p. 39], that after about three thousand terms all the irregularities disappear. 
By using the Salvy-Zimmermann gfun package in Maple [18], we find that the sequence, 
A071049 . satisfies a linear recurrence of order 469: for n > 2854, 

On+469 = —fln+453 + «n+256 + 0-n+ 240 + On+229 + On+213 _ «n+16 _ «n- (40) 

This recurrence is far nicer than it initially appears: the coefficients are palindromic, and its 
characteristic polynomial is the product of 25 irreducible factors. 
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• Rule 126: a n = 2 wt ( n ) +1 , except we must subtract 1 if n = 2 k — 1 for some k (A071051). 

• Rule 150: This is the odd-rule CA defined by the three-celled neighborhood. The sequence 
[a n ,n > 0] (A071053) was analyzed by Wolfram [23] (see also [9], [19])- In the notation of the 
present paper [a n ,n > 0] is the run length transform of the Jacobstahl sequence A001045 . 
Theorems 1 and 2 were suggested by reading Sillke’s analysis [19]. 

6.2 Other odd-rule CAs 

In this section we discuss the odd-rule CAs defined by the height-one neighborhoods in Fig. 1. 
(The height-two neighborhood of Fig. l(iv) is discussed in the last section of the paper.) Table 
2 summarizes the results. The first column specifies the neighborhood F in Fig. 1, a n (F) is the 
number of ON cells at generation n, b n (F) denotes the sequence of which a n (F ) is the run length 
transform, and the fourth column gives a generating function (g.f.) for b n (f). For Fig. (iii), Fib n _|_2 
denotes a Fibonacci number. The g.f. for (vii), found by Doron Zeilberger [3], is 

(1 + 2 x) (l + x — x 2 + x 3 + 2 x 5 ) 

1 — 3 x — 3 x 2 + x 3 + 6 x 4 — 10 x 5 + 8 x 6 — 8 x 7 ' 

An expanded version of this table, analyzing all the sequences arising from odd-rule CAs defined 
by height-one neighborhoods on the square grid, will be published elsewhere [4]. 


Table 2: Odd-rule CAs defined by height-1 neighborhoods F in Fig. 1. 


F 

a n {F) 

b n {F) 

g-f- 

Notes 

(i) 

(ii) 

(iii) 

(v) 

(vi) 

(vii) 
(viii) 

(ix) 

(x) 

A071053 

A048883 

A253064 

A001045 

A000244 

A087206 

l+ 2 x 

l—x— 2 x 2 

1 

1— 3 a: 
l+2a; 

1—2a;— 4 a; 2 

1 

1— 4 x 
l+ 2 x 

l- 3 x- 2 x ' 2 

(41) 

{l+ 2 x)(l+ 2 x+Ax 2 ) 
l— 3 a:—8a; 3 —8a; 4 
I+62 

I-22-82 2 
I+62—82 2 

Rule 150, §6.1 
a n = 3 wt ( n ) 
b n = 2 n Fib n+ 2 
a n = 4 wt ( n ) 

§5 

Fredkin Replicator, §4 
Squares of entries from (i) 

A102376 

A072272 

A253069 

A246039 

A160239 

A246035 

A000302 

A007483 

A253070 

A246038 

A246030 

A139818 

(l-s)(l+ 2 s)(l- 4 s) 


6.3 Further two-dimensional CAs 

If we drop the “odd-rule” definition, the number of CAs grows astronomically—there are 2 512 
based on the Moore neighborhood alone. Pages 171-175 of [26] show many examples of the subset 
of “totalistic” rules, in which the next state of a cell depends only on its present state and the total 
number of ON cells surrounding it. All of these are potential sources of sequences. In a few cases 
it is possible to analyze the sequence, but usually it seems that no formula or recurrence exists. 
In this section we give three examples: one that can be analyzed, one that might be analyzable 
with further research, and one (typical of the majority) where the state diagrams are aesthetically 
appealing but finding a formula seems hopeless. All three are totalistic rules, the first two being 
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based on the von Neumann neighborhood (Fig. l(v)), and the third on the Moore neighborhood 
(Fig. l(ix)). 

The first example is the Rule 750 automaton, in which an OFF cell turns ON if an odd number 
of its four neighbors are ON, and once a cell is ON it stays ON [26, p. 925]. This CA is a hybrid 
of the “odd-rule” CAs studied above and the “once a cell is ON it stays ON” rules studied in [2], 
Here it is convenient to call the initial ON cell generation 1 (rather than 0). The numbers of ON 
cells in the first few generations (A169707) are given in Table 3. 


Table 3: Number of ON cells at nth generation of CA defined by Rule 750. 


n 


d n 


1 1 


2 

- 3 

5 

9 






4 

- 7 

21 

25 

37 

57 




8 

- 15 

85 

89 

101 

121 

149 

169 

213 281 

16 

- 31 

341 

345 

357 

377 

405 

425 

469 ... 


The evolution of this CA is similar to several that were studied in [2]: at generation 2 k , for 
k > 2, the structure is enclosed in a diamond-shaped region, which is saturated in the sense that 
no additional interior cells can ever be turned ON, and contains a 2 k = (4 fc+1 — l)/3 ON cells. 
Then in generations 2 k + 1 to 2 k+1 — 1, the structure grows outwards from the four vertices of 
the diamond, and the first half of the growth that follows generation 2 k is the same as the the 
growth that followed generation 2 k ~ 1 . Figure 7 shows generation 20 = 2 4 + 4, where we can see 
that 16 cells have grown out of each vertex. Pictures of generations 16 = 2 3 + 4 and 36 = 2 5 + 4 
show exactly the same growth from the vertices (although with different numbers of ON cells in 
the central diamond). 



Figure 7: Generation 20 of CA defined by Rule 750, showing 341 + 4(1 + 3 + 5 + 7) = 405 ON cells, 
illustrating (43). 
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The successive numbers of ON cells added to a vertex in the generations from 2 k to 2 k+l — 1 
are 0 , 1 , 3, 5, 7, 5,11,17,15, 5,..., which form the initial terms uo, v\, vi, ... of a sequence ( A151548 ) 
encountered in [2], The v t have generating function 

OO 

zr^— + 4x 2 rf(l + x 2r - 1 11 + 2a: 2r ). (42) 

1 + x - LJ ; 

r= 1 

Then, for k > 0 and 0 < m < 2 k , we have 

4^+1 _ ^ 777 ' 

a 2 k +m = n + v i- ( 43 ) 

i =0 

Bearing in mind the warning in the first sentence of this paper, we must admit that we have not 
written out a complete proof that (43) is correct. However, there should be no difficulty in filling 
in the details: as the automaton evolves from generation 2 k to 2 k+1 , the structure has a natural 
dissection into polygonal pieces. 

The second example is more speculative: this is the Rule 493 automaton [26, p. 173], A246333 . 
The binary expansions of 493 and 750 differ in just four places, so it is not surprising that this 
is similar to the previous example. Now an ON cell stays ON unless exactly zero or four of its 
neighbors are ON, in which case it turns OFF, and an OFF cell turns ON unless exactly two of 
its neighbors are ON. Assuming here that we start with a single ON cell at generation 0, in the 
even-numbered generations the number of ON cells is finite (A246334): 

1, 5,17, 29, 61, 73,109,157, 229, 241, 277, 329,429,477, 573, 633, 861, 

while in the odd-numbered generations the number of OFF cells is finite (A246335): 

1, 5, 9, 21, 25, 37, 57,85, 89,101,121,165,169, 213, 217, 317,321, 333,.... 

The reason for hoping this automaton might be analyzable is that the latter sequence agrees with 
the sequence in Table 3 up though the eleventh term, 121, after which the sequences diverge. Even 
the respective states are the same up through the sixth term, 37, although to see this one has to 
work with the negatives—in the photographer’s sense, interchanging black and while cells—and 
then rotating the result by 45 degrees. This needs further investigation. 

The third example in the eight-neighbor Rule 780 ( A246310 ), in which a cell turns ON if one 
or four of its neighbors in ON, and otherwise turns OFF. Although the initial generations are 
simple enough, already by generation 15 (Fig. 8 ) the structure is extremely complicated. Is there 
a recurrence? Is the five-neighbor analog (A253086) any easier to understand? 

6.4 The three-dimensional analog of Fredkin’s Replicator 

The three-dimensional Moore neighborhood, that is, the 3x3x3 cube without its center cell, gives 
rise to the sequence 1, 26, 26,124, 26,676,124,1400,... (A246031), which by Theorem 2 is the run 
length transform of the subsequence 

1, 26,124,1400,10000,89504, 707008, 5924480,47900416, 393069824, 3189761536, 25963397888,... 

(44) 

( A246032 ). computed by Roman Pearce and Michael Monagan. Doron Zeilberger [3] has found a 
generating function, a rational function with numerator of degree 10 and denominator of degree 

11 , as well as a proof that it is correct. 
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Figure 8 : Generation 15 of CA defined by eight-neighbor Rule 780. 


7 Further remarks about run length transforms 

Block structure. It is a surprising fact that the growth sequences of many CAs have a natural 
division into blocks of successive lengths ( 1 , ) 1 , 2,4,8,16, 32,.... This is true even for some CAs 
that are defined on lattices other than [2]. Some of these examples are explained by the fact 
that the run length transform always has this property—the division into blocks of the run length 
transform [T n , n > 0] of an arbitrary sequence S = [1, A, B , C,D,...} is shown in Table 4. Table 1 
above gives a concrete example. The first half of each row is given by A times the beginning of the 
[T n \ sequence itself. 


Table 4: The run length transform [T n , n > 0] of a sequence S = [1, A , B , C, D,.. .], showing the 
division into blocks of sizes 1 , 1 , 2 ,4, 8,16,.... 


n 

T n 


0 

1 









1 

A 








2 

- 3 

A 

B 







4 

- 7 

A 

A 2 

B 

C 





8 

- 15 

A 

A 2 

A 2 

AB 

B 

AB 

C 

D 

16 

- 31 

A 

A 2 

A 2 

AB 

A 2 

A 3 

AB 

AC B AB AB B 2 C AC D E 

32 

- 63 

A 

A 2 

A 2 

AB 

A 2 

A 3 

AB 

AC ... 


Further examples. We briefly mention four additional examples of run length transforms. 
The run length transform of 0,1, 2 ,3, 4,... is 1,1,1, 2 , 1 , 1 , 2, 3,1,... ( A227349 ), which gives the 
product of the lengths of runs of Is in the binary representation of n. The primes, prefixed by 
1 , give 1 , 2 , 2 , 3, 2 ,4,3, 5, 2 ,... (A246029)- The squares give 1 , 1 ,1,4, 1 , 1 , 4, 9,... (A246595)- The 
powers of 2 give 1, 2, 2,4, 2,4,4, 8, 2,..., 2 wt ( n ) ( ^001316 , already mentioned in §6.1). 

Graphs. The graphs of run length transforms are usually highly irregular, as one expects 
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from Table 4. The partial sums of these sequences are naturally smoother, and generally have 
a family resemblance, with a bumpy appearance somewhat similar to what is seen in the Tagaki 
curve [1], [12]. The partial sums of the four examples in the previous paragraph are A253083 , 
A253081 . A253082 . A006046 . respectively, and the partial sums of the sequences arising from 
Fredkin’s Replicator, the sequence in Sect. 5, and the Rule 150 sequence in Sect. 6 are respectively 
A245542 , A253908 . A134659 . 2 The latter sequence is discussed in [6], and it would be interesting 
to see if the methods of that paper can be applied to the other six sequences. Also, is there any 
direct connection between the limiting form of these graphs for large n and the Tagaki curve? 

The generalized run length transform. There are analogs of Theorem 2 which apply to 
larger neighborhoods, although they are more complicated and not as useful. The following is a 
version which applies when the neighborhood F has height at most 2. Whereas in Theorem 2, a n (F ) 
was expressed as a product of terms from the subsequence a m (F) where the binary expansion of m 
contained no zeros, now we need the values a m (F) where m is any number whose binary expansion 
begins and ends with 1 and does not contain any pair of adjacent zeros. These are the numbers 
( A247648 ) 


1, 3, 5, 7,11,13,15, 21, 23, 27, 29, 31,43,45,47, 53, 55, 59, 61, 63,. 


(45) 


Suppose for simplicity that the binary expansion of n has the form 


n 



mi, m 3 > 1 , m 2 > 2 , 


where the asterisks indicate strings of Os and Is that begin and end with Is and do not contain 
any pair of adjacent zeros. If the first such string represents N\ and the second N 2 , then a n {F) = 
a]y 1 (F)aN 2 (F). There is an analogous expression in the general case, expressing a n (F) as a product 
of terms a m (F ) where m belongs to (45). 

To illustrate, suppose F is the five-celled one-dimensional neighborhood shown in Fig. l(iv), 
with height 2. The initial values of a n (F ) are given in Table 5, with a m {F) for m in (45) shown 
in bold. For example, the binary expansion of 167 is 10100111, so the generalized run length 
transform tells us that ai 67 (T) = as(F)aT(F) = 17.19 = 323. It follows from the generalized run 
length transform property that in each row of the table, the first one-eighth of the terms coincide 
with 5 times the beginning of the sequence itself. 


Table 5: Number of ON cells at nth generation of of odd-rule one-dimensional CA defined by a 
5-celled neighborhood (A247649). 


n 

On 

0 

1 

1 

5 

2-3 

5 7 

4-7 

5 17 7 19 

8-15 

5 25 17 19 7 31 19 25 

16 - 31 

5 25 25 35 17 61 19 71 ... 


The bold-faced entries in Table 5 form A253085 , and we end with one last question: is there an 
independent characterization of this sequence? An affirmative answer might make the generalized 
run length transform a lot more interesting. Much remains to be done in this subject! 

2 The “graph” button in [16] makes it easy to compare these graphs. However, it is not clear how the growth rate 
of the original sequence affects the “bumpiness” of the partial sums. 
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Postscript, March 2015 


After seeing an initial version of this paper, Doron Zeilberger observed that it is possible to use 
Theorems 1 and 2 to automate calculation of sequences giving the number of ON cells in odd-rule 
CAs, and in the case of height-one neighborhoods, to find and rigorously prove the correctness of 
generating functions for the sequences of which they are the run length transforms. Details will 
appear elsewhere [3, 4]. 
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